Conformation-dependent ligand hot spots in the spliceosomal RNA helicase BRR2

Fragment derivatives occupy diverse binding sites in two crystal forms of the RNA helicase BRR2 that coincide with predicted, conformation-dependent ligand hot spots.


Introduction
Fragment-based drug discovery (FBDD) involves as an initial step the screening of small libraries of fragments of comparably low molecular weight (typically below 300 Da) for binding to a target protein (Scott et al., 2012;Congreve et al., 2003). FBDD offers an alternative to the high-throughput screening of large collections of higher molecular-weight compounds based on in vitro activity, binding or biological effects. Due to their low molecular weight and reduced combinatorial complexity with respect to chemical functionalities, fragments show higher probabilities of protein binding and can be used to probe chemical space more efficiently and densely than is possible with larger compounds (Leach et al., 2006). Furthermore, the optimization of fragments usually gives rise to leads with a lower lipophilicity, higher water solubility and improved cell permeability, whereas high attrition rates are often observed during the optimization of higher molecular-weight compounds (Murray et al., 2012;Lipinski et al., 2001;Teague et al., 1999;Congreve et al., 2003). As the affinity of fragments to target proteins is typically low (usually in the range 0.1-10 mM), sensitive biophysical methods are required for the detection of fragment binding (Schiebel et al., 2016). In recent years, X-ray crystallography has emerged as one of the preferred methods for detecting initial fragment hits, as it (i) can nowadays be conducted in a high-throughput format, (ii) allows the identification of weakly binding substances due to the high substance concentrations that can be employed and (iii) provides immediate insights into the binding poses of the compounds with the target protein (Schiebel et al., 2016;Chilingaryan et al., 2012).
During elaboration of the initial hits, the binding mode of analogues with the same core scaffold is expected to be retained. A conserved binding mode is indeed observed in many cases (Murray et al., 2012). For example, fragments derived from the chitinase inhibitor argifin (Andersen et al., 2008) and fragments binding to the Src kinase showed the same interaction modes in structures compared with the intact inhibitors (Lange et al., 2003). However, conservation of the binding modes is not always observed in inhibitor-to-fragment deconstruction studies. For example, after the deconstruction of nine inhibitors of Bcl-xL into 22 fragments, three of the fragments showed no interaction with the protein, the binding of six fragments could not be detected via NMR chemical shift changes and one fragment led to chemical shift changes in different regions of the protein, suggesting that it bound at a different position (Barelier et al., 2010). A particularly remarkable example is afforded by the deconstruction of an inhibitor of the AmpC -lactamase into three fragments, none of which bound to the original position in crystal structures (Babaoglu & Shoichet, 2006). More recently, 21 fragments were generated from three inhibitors of the HIV-1 reverse transcriptase. Only nine of them could be detected as binders in surface plasmon resonance (SPR) measurements; however, all of these compounds were above the typical fragment size, whereas all fragment-sized molecules did not interact (Brandt et al., 2011). Likewise, the binding of fragments derived from an MDM2 inhibitor, nutlin, to the target protein was not detected when probed by SPR and NMR, and only larger portions resulted in detectable binding affinities (Fry et al., 2013). Furthermore, the decomposition of the substrates of six enzymes into 41 compounds did not result in detectable binding or inhibitory activity for most of the fragments (Barelier et al., 2014).
To identify features that contribute to changed binding modes, Malhotra and coworkers conducted a large-scale analysis of Protein Data Bank (PDB) entries for proteins in complex with ligands or in complex with smaller substructures, observing changed binding modes with a surprisingly high frequency (14%; Malhotra & Karanicolas, 2017). The analysis suggested that compounds with lower molecular weight, weaker protein binding, higher polarity and higher rigidity were more prone to show binding-mode alterations upon enlargement (Malhotra & Karanicolas, 2017). In another large-scale study of PDB structures, Drwal and coworkers noted an average conservation of above 80% of the binding modes of smaller ligands compared with the binding modes of larger ligands with similar (but not identical) substructures in almost 75% of the compared pairs (Drwal et al., 2018). The authors further reported that the binding mode tends to be conserved for fragments with a molecular weight above 150 Da (Drwal et al., 2018). Increased flexibility of the protein in the ligand-binding site was thought to contribute to changes in binding modes (Drwal et al., 2018). In an attempt to analyse the protein properties that are associated with changed fragment-binding modes, Kozakov and coworkers surveyed the overlap of fragment-binding sites with binding hot spots, i.e. regions contributing a substantial amount of free energy upon binding, and suggested that the strength of a hot spot and its overlap with a fragment-binding site are critical for binding-mode conservation (Kozakov, Hall, Jehle et al., 2015).
Here, we report a test case in which we probed the binding of compounds derived from an initial fragment hit for the spliceosomal RNA helicase BRR2 in one crystal form to the protein with an altered conformation in another crystal form. BRR2 is responsible for the unwinding of U4/U6 di-small nuclear (sn) RNAs, an essential step for the conversion of a pre-catalytic spliceosome (B complex) to an activated spliceosome (B act complex; Raghunathan & Guthrie, 1998;Laggerbauer et al., 1998). The $250 kDa enzyme is composed of an N-terminal auto-regulatory region of about 400 residues and tandem helicase cassettes, each comprising two RecA-like domains (RecA1 and RecA2), a winged-helix (WH) domain, a helical bundle (HB) or ratchet domain, a helix-loop-helix (HLH) domain and an immunoglobulin-like (IG) domain (Absmeier et al., 2015;Santos et al., 2012;Pena et al., 2009;Fig. 1a). Due to its essential role in each splicing event and its association with the genetically inherited visual deficiency retinitis pigmentosa (Zhao et al., 2009;Ledoux & Guthrie, 2016;Rů žičková & Staně k, 2017), BRR2 is an interesting target for the development of small-molecule modulators. In our study, nine fragment derivatives with the same core structure showed drastic alterations in their binding modes or locations, occupying six different pockets on the protein upon conformational changes affecting the original fragmentbinding site. Some of the fragment derivatives could be interesting starting points for further optimization into novel splicing modulators. Our study provides a test case for fragment elaboration on a large protein with a high number of potential ligand-binding pockets and high intrinsic flexibility.

Protein production and purification
Proteins were produced via recombinant baculoviruses in insect cells as described previously (Mozaffari-Jovin et al., 2013;Vester et al., 2020). Briefly, bacmids encoding an N-terminally truncated human BRR2 (hBRR2) variant (residues 395-2129; hBRR2 T1 ) fused to an N-terminal TEVcleavable His tag or to a C-terminal Jab1/MPN region of the hPRPF8 protein lacking a hBRR2-inhibitory C-terminal tail (residues 2064-2320; hJab1 ÁC ) fused to an N-terminal PreScission-cleavable GST tag were transfected into Sf9 cells for the production of first virus generations. The viruses were used to infect Sf9 cells to generate the second virus generations. Proteins were produced on a large scale upon the infection of High Five cells with the second generation of viruses.
hBRR2 T1 and hJab1 ÁC proteins were produced and purified as described previously (Mozaffari-Jovin et al., 2013;Vester et al., 2020)  Crystal structures of hBRR2 T1 and the binding site of sulfaguanidine. (a) Domain scheme of hBRR2 T1 . hBRR2 T1 comprises tandem helicase cassettes (NC, N-terminal helicase cassette; CC, C-terminal helicase cassette), each containing dual RecA-like domains (RecA1 and RecA2), a winged-helix (WH) domain, a helical bundle (HB) domain, a helix-loop-helix (HLH) domain and an immunoglobulin-like (IG) domain. The HB, HLH and IG domains form Sec63 homolog units. (b) Crystal structure of isolated hBRR2 T1 in complex with sulfaguanidine (boxed). Unless mentioned otherwise, in this and the following figures or figure panels, hBRR2 domains are coloured according to the scheme in (a). (c) Sulfaguanidine binding pocket in isolated hBRR2 T1 . Residues of hBRR2 T1 involved in the binding of sulfaguanidine are shown as sticks and are coloured by atom type. Unless mentioned otherwise, in this and the following figures or figure panels, protein C atoms are coloured as the respective protein region, compound C atoms in light grey, N atoms in blue, O atoms in red and S atoms in yellow; violet dashed lines represent van der Waals interactions or -stacking interactions and yellow dashed lines represent hydrogen bonds. (d) Crystal structure of the hBRR2 T1 -hJab1 ÁC complex (PDB entry 6s8q). Unless mentioned otherwise, in this and the following figures or figure panels, hJab1 ÁC is shown in gold. (e) Selected interatomic distances between residues of the NC and the CC in the hBRR2 T1 -sulfaguanidine complex structure (left panel) and in the hBRR2 T1 -hJab1 ÁC structure (right panel) are shown as pink dashed lines. Bound sulfaguanidine was omitted from the left panel.
200 mM NaCl, concentrated to 10 mg ml À1 , aliquoted and flash-frozen in liquid nitrogen. For the hBRR2 T1 protein used for activity assays, the His tag was not cleaved and the final SEC step was conducted with 10 mM Tris-HCl pH 7.5, 200 mM NaCl, 10%(v/v) glycerol. The hJab1 ÁC protein was purified via glutathione (GSH) beads (GE Healthcare), buffer-exchanged by SEC, dialyzed overnight including cleavage of the GST tag and passed over GSH beads again. After SEC on a Superdex 75 16/60 column (GE Healthcare) in 10 mM Tris-HCl pH 8.0, 150 mM NaCl, the protein was concentrated to 4 mg ml À1 , aliquoted and flash-frozen in liquid nitrogen. The hBRR2 T1 -hJab1 ÁC complex was formed by mixing purified hBRR2 T1 with a 1.5-molar excess of purified hJab1 ÁC , followed by SEC on a Superdex 200 10/300 Global Increase column (GE Healthcare) in 20 mM Tris-HCl pH 8.0, 150 mM NaCl, concentrating to 6 mg ml À1 , aliquoting and flash-freezing in liquid nitrogen.
The initial sulfaguanidine [4-amino-N-(aminoiminomethyl)benzolsulfonamide] hit was obtained by co-crystallization of hBRR2 T1 with a cocktail of additives (Silver Bullets condition 12; Hampton Research; added to the crystallization drop at 1/10 volume) and was subsequently validated by co-crystallization with sulfaguanidine alone (50 mM final concentration in the crystallization drop). Apart from the addition of the additive cocktail or of sulfaguanidine, co-crystallization was conducted as for isolated hBRR2 T1 crystals. After cryoprotection with 0.1 M sodium citrate, 3.0 M sodium malonate pH 7.0, 0.1 M NaCl, crystals were flash-cooled in liquid nitrogen for subsequent diffraction data collection.
Both co-crystallization in the presence of fragment derivatives and the soaking of preformed hBRR2 T1 crystals with fragment derivatives were tested to obtain additional fragment derivative-bound hBRR2 T1 crystal structures. For cocrystallization of isolated hBRR2 T1 with fragment derivatives, small-molecule stock solutions were combined with the protein solution in a 1:9 volume ratio so that the final DMSO concentration was 10%(v/v). For soaking experiments, preformed hBRR2 T1 crystals were incubated with varying concentrations of fragment derivatives and for different times. Neither co-crystallization nor soaking led to additional fragment-bound hBRR2 T1 structures (see Section 3).
For hBRR2 T1 -hJab1 ÁC complex crystals, soaking was used as a strategy to obtain additional fragment-bound structures.
Diffraction data were collected on beamline 14.1 of the BESSY II storage ring, Berlin, Germany and were processed with XDS (Kabsch, 2010). Molecular replacement with Phaser (McCoy et al., 2007) was performed in Phenix (Liebschner et al., 2019). Structures were refined by manual model building with Coot (Emsley et al., 2010) and automated refinement with phenix.refine (Afonine et al., 2012). The ligands were manually positioned into the densities, and parameters for the ligands were derived from the eLBOW tool of Phenix (Moriarty et al., 2009). In the final rounds of refinement, the B factors of the protein were refined by translation-libration-screw rotation (TLS) refinement. For the structure of hBRR2 T1 with sulfaguanidine, three TLS groups were defined, spanning residues 402-812, 813-1813 and 1814-2125 of hBRR2 T1 . For the structures of hBRR2 T1 -hJab1 ÁC in complex with compounds, one TLS group was defined for hBRR2 T1 and a second TLS group for hJab1 ÁC . The B factors of compounds, water molecules and ethylene glycol molecules were refined isotropically. In the course of the refinement, the occupancy of the compounds was set to 1.0. Model quality was assessed with MolProbity (Chen et al., 2010;Williams et al., 2018). 3D structure figures were prepared with PyMOL (Schrö dinger). Binding schematics were prepared with LigPlot (Wallace et al., 1995). Crystallographic parameters are provided in Table 1.

Docking and analogue search
Template-guided docking was carried out with FlexX (Rarey et al., 1996;Metz et al., 2021) using the crystallographic binding pose of sulfaguanidine or a substructure as the template. The FlexX algorithm deconstructs every docked compound into smaller substructures, aligns the best matching substructures to the corresponding substructures of the template and reconstructs the ligand while allowing it to flexibly explore the environment of the binding pocket. Searching for suitable analogues that contain sulfaguanidine as a substructure yielded 52 analogues from MolPort and 315 from the National Cancer Institute Developmental Therapeutics Program (NCI/DTP; https://dtp.cancer.gov). Searches were conducted using the MolPort Chemical Search node (SIA MolPort, Latvia) within Konstanz Information Miner (KNIME) version 3.4.0 (Berthold et al., 2008;Kim et al., 2016). To increase the number of suitable candidates, we included all tautomeric and protonation states of the interacting sulfaguanidine core motif [N-(aminomethyl)methanesulfonamide and N 0 -methylsulfonylmethanimidamide] as query structures, resulting in the retrieval of 187 and 22 additional compounds from MolPort and NCI/DTP, respectively. However, we noticed that the MolPort web interface still returned additional results compared with the KNIME node, presumably due to an internal normalization of the query structures provided as SMILES strings in KNIME. Extending the comprehensive search to the MolPort web interface yielded 396 suitable analogues from MolPort.
To facilitate template-guided docking of additional analogues that contain the core motif but not the complete  .7 (6.8) 6.7 (6.4) 6.7 (6.8) 6.7 (6.4) 6.7 (6.5) 6.7 (6.9) 6.5 (5.6) 10.5 (10.5) 6.7 (6.9) 6.8 (6.9) Data quality Intensity where hI(hkl)i is the mean intensity of symmetry-equivalent reflections hkl, I i (hkl) is the intensity of a particular observation of hkl and N(hkl) is the number of redundant observations of reflection hkl (Diederichs & Karplus, 1997). ‡ CC 1/2 = ðhI 2 i À hIi 2 Þ=ðhI 2 i À hIi 2 þ 2 " , where 2 " is the mean error within a half data set (Karplus & Diederichs, 2012). § Root-mean-square deviation. } Calculated with MolProbity (Williams et al., 2018). † † Clashscore is the number of serious steric overlaps (>0.4) per 1000 atoms (Williams et al., 2018). sulfaguanidine moiety as a substructure, we used six substructures of sulfaguanidine in its crystallographically observed pose (Figs. 1b and 1c; Supplementary Fig. S1) as templates. Only the three best-scoring docking poses of each compound were retained, resulting in the expected overlap of the docked analogues with the crystallographic sulfaguanidine pose for all possible cases. Compounds that could not be docked due to an unavoidable steric overlap with the protein did not result in a docking pose.
To enrich small but efficiently binding ligands, all docked poses were then rescored with DrugScore DSX (Neudert & Klebe, 2011) using default settings and were ranked by the DSX score normalized by the number of non-H atoms. All poses were visually assessed by visualization with the PyMOL session produced by DSX, highlighting interactions and the contribution of each atom to the total DSX score.
The primary aim of the above candidate selection was to identify extended sulfaguanidine analogues that form additional interactions and may exhibit enhanced affinities. However, due to the limited pool of selected compounds, we included additional compounds that (i) are of a similar size to or even smaller than sulfaguanidine yet may elucidate which interactions are crucial for or may improve the binding pose of sulfaguanidine, (ii) extend from the sulfaguanidine pose but do not form optimal interactions in the docking pose yet may do so upon induced-fit adjustment of the binding site that was not considered during docking or (iii) contain chemical motifs that are potentially unsuited for testing in functional assays due to chemical reactivity or unspecific activity, for example pan-assay interference compounds (Baell & Nissink, 2018) or imines, yet may elucidate potential interactions if bound in a crystal structure. The selected compounds are listed in Supplementary Table S1.

Small-molecule stocks
Small molecules were obtained from NCI/DTP (https:// dtp.cancer.gov), purchased from MolPort or collected from an in-house fragment library stock. All substances were dissolved in 100% DMSO at concentrations ranging from 250 mM to 1 M depending on their solubility. Small-molecule stocks were stored at À20 C until further use.

RNA duplex-unwinding assays
RNA duplex-unwinding assays were performed with a U4*/ U6 di-snRNA substrate (U4*, [ 32 P]-labelled U4 snRNA) that was prepared as described previously (Theuser et al., 2016). 200 nM hBRR2 T1 was incubated with or without compounds [at a concentration of 1 mM in the initial screen and at the indicated concentrations in IC 50 evaluation assays; final DMSO concentration of 4 or 10%(v/v)] for 3 min at 30 C in unwinding buffer [40 mM Tris-HCl pH 7.5, 50 mM NaCl, 0.5 mM MgCl 2 , 15 ng ml À1 acetylated BSA, 1 U ml À1 RNasin (Molox), 8%(v/v) glycerol]. Reactions were started by the addition of 1.7 mM ATP/MgCl 2 and 10 ml samples were taken at defined time points and mixed in a 1:1 ratio with 40 mM Tris-HCl pH 7.4, 50 mM NaCl, 25 mM EDTA, 1%(w/v) SDS, 10%(v/v) glycerol, 0.05%(w/v) xylene cyanol, 0.05%(w/v) bromophenol blue. The samples were loaded onto 4% native PAGE. After electrophoresis at 170 V and 4 C for 2-3 h, the gels were transferred to a filter paper, dried and used for the exposure of a storage phosphor screen. Detection of radioactive bands was performed by scanning with a Typhoon phosphorimager (GE Healthcare).

Initial fragment hit and identification of derivatives
In this study, we worked with an N-terminally truncated version of the human spliceosomal RNA helicase, hBRR2, which lacks the auto-inhibitory N-terminal region of the enzyme (residues 395-2129; hBRR2 T1 ) and exhibits enhanced RNA helicase activity compared with the full-length protein (Santos et al., 2012). We identified sulfaguanidine [4-amino-N-(aminoiminomethyl)-benzolsulfonamide], a known antibacterial drug (Abidi et al., 2018), as a ligand of hBRR2 T1 in a crystallization screen with additives (Silver Bullets condition 12; Hampton Research, catalogue No. HR2-096). In this hBRR2 T1 crystal structure, sulfaguanidine is positioned at the interface between the two helicase cassettes, directly contacting the N-terminal HB and IG domains as well as the C-terminal RecA2 domain (Figs. 1a-1c). The binding involves multiple hydrogen bonds, such as those between the main chain of Tyr1238 of the N-terminal IG domain and Gln1707 of the C-terminal RecA-2 domain and the sulfonyl O atom (Fig. 1c). The para-positioned amino group forms a hydrogen bond to Glu1097 of the HB domain and the guanidine moiety is contacted by a hydrogen bond to the main chain of His1236 of the IG domain. Furthermore, the binding is supported by -stacking interactions with Tyr1238 and van der Waals interactions of the benzene ring with Trp1222 (N-terminal IG domain) and Asn1531 (N-terminal RecA2 domain; Fig. 1c). Sulfaguanidine is contacted by a multitude of interactions including domains of both helicase cassettes.
It has been shown that only the N-terminal cassette (NC) of BRR2 is an active helicase, whereas the C-terminal cassette (CC) is inactive in unwinding and ATP hydrolysis (Kim & Rossi, 1999;Santos et al., 2012). The CC is assumed to regulate the activity of the NC via the interface between the two cassettes, and it has been shown that residue exchanges that affect this interface influence the activity of the NC (Santos et al., 2012;Vester et al., 2020). Inhibitors of hBRR2 have recently been reported which bind at the interface between the helicase cassettes (Iwatani-Yoshihara et al., 2017;Ito et al., 2017). Sulfaguanidine occupies a different binding site displaced by about 9 Å from BRR2-inhibitory compounds ( Supplementary Fig. S2) and did not influence the RNA helicase activity of hBRR2 T1 (Supplementary Fig. S3). However, as sulfaguanidine qualifies as a fragment (molecular weight 214.2 g mol À1 ) and exhibits low lipophilicity (c logP = À1.1; Enhanced NCI Database Browser 2.2), further modification may be possible to yield hBRR2-modulating substances. We therefore set out to explore whether sulfaguanidine research papers Acta Cryst. (2023). D79, 304-317 derivatives can be identified which exhibit altered binding and possibly hBRR2-modulatory activities.
Based on the structure of the hBRR2 T1 -sulfaguanidine complex, we carried out structure-guided docking of sulfaguanidine derivatives. Sulfaguanidine as well as six substructures of sulfaguanidine in its crystallographically observed pose ( Supplementary Fig. S1) were used as templates for guided docking with the aim of identifying molecules that might exhibit a similar binding mode. 93 compounds with favourable docking poses were ordered from the NCI/DTP, purchased from MolPort or collected from an in-house fragment library stock (Supplementary Table S1; Supplementary Fig. S4). We attempted to co-crystallize hBRR2 T1 with the derivatives under conditions that yielded hBRR2 T1 -sulfaguanidine crystals or to soak derivatives into preformed hBRR2 T1 crystals. However, neither strategy yielded hBRR2 T1 -derivative structures, possibly because crystal formation was impaired by the high fragment concentrations in some co-crystallization trials or because the high-salt crystallization conditions (including 1.5 M malonate) may have counteracted fragment binding during co-crystallization or soaking. Crystal soaking was also hampered by the deterioration of hBRR2 T1 crystals upon soaking with DMSO-containing solutions.

Binding of sulfaguanidine derivatives to conformationally rearranged hBRR2 T1
hBRR2 or hBRR2 T1 can bind to the C-terminal Jab1/MPN domain (residues 2064-2335) of the hPRPF8 protein (Mozaffari-Jovin et al., 2013;Maeder et al., 2009). When the C-terminal tail of the hPRPF8 Jab1/MPN domain extends into the RNA-binding tunnel of hBRR2, the domain acts as an inhibitor of the helicase; when the tail is removed, the hPRPF8 Jab1/MPN domain activates hBRR2 helicase activity, an effect that can be mimicked by a C-terminally truncated version of the domain (residues 2064-2320; hJab1 ÁC ; Mozaffari-Jovin et al., 2013). We have recently reported a crystal structure of a hBRR2 T1 -hJab1 ÁC complex in which the hBRR2 T1 helicase cassettes adopted a different relative orientation compared with that in isolated hBRR2 T1 (Fig. 1d; PDB entry 6s8q; Vester et al., 2020). In particular, the sulfaguanidine-binding surfaces of hBRR2 T1 are rearranged in the hBRR2 T1 -hJab1 ÁC crystals, with increased distances between residues of the two cassettes that contact sulfaguanidine in the hBRR2 T1 -sulfaguanidine structure (Fig. 1e). In the following, the conformation of hBRR2 in the monomeric hBRR2 T1 crystal structure will be referred to as conformation 1, whereas the conformation of hBrr2 in the hBRR2 T1 -hJab1 ÁC complex crystals will be referred to as conformation 2.
As the crystallization condition of the hBRR2 T1 -hJab1 ÁC complex [0.1 M HEPES-NaOH pH 8.0, 0.1 M MgCl 2 , 8%(w/v) PEG 3350] appeared to be more suitable for fragment binding, and as hBRR2 T1 -hJab1 ÁC crystals grown under these conditions were stable in solutions of 10%(v/v) DMSO, we decided to explore whether, and if so how, sulfaguanidine and derivatives might bind to the hBRR2 T1 -hJab1 ÁC complex with rearranged helicase cassettes. To this end, we selected sulfaguanidine and a subset of 55 compounds from our collection of sulfaguanidine derivatives, which was enriched in higher structural similarity to sulfaguanidine, for crystallographic binding studies using the hBRR2 T1 -hJab1 ÁC crystals. Sulfaguanidine itself did not bind to the hBRR2 T1 -hJab1 ÁC crystals. However, crystals soaked with nine derivatives yielded electron densities indicating clear binding of the substances (Supplementary Fig. S5; crystallographic parameters are listed in Table 1).
All bound compounds exhibited the same 4-aminophenyl sulfonamide scaffold (Fig. 2). A total of six pockets were identified as binding sites of the ligands (Fig. 3a). Five compounds (24, 50, 76, 78 and 86) bound only to a single pocket, two fragments (26 and 34) bound at two pockets and two substances (18 and 39) bound at three pockets simultaneously. Density for an additional molecule of compound 18 bound at two neighbouring hBRR2 T1 -hJab1 ÁC complexes in the crystal lattice emerged during later stages of refinement; as it depended on the crystal packing, this binding site was excluded from further analysis.
Four of the compounds (18, 26, 34 and 39) bound at the cassette interface regions of the original sulfaguanidine binding site of hBRR2 T1 (referred to as pocket 1; Fig. 3a). All four fragments that occupied pocket 1 exhibited a similar binding pose as sulfaguanidine in the hBRR2 T1 structure (Fig. 3a). Six fragments bound additionally (18, 34 and 39) or alternatively (76, 78 and 86) at pocket 2, located within the N-terminal RecA1 domain, with two different binding modes (binding mode 1 for compounds 18, 34 and 39; binding mode 2 for compounds 76, 78 and 86; Fig. 3a). Two fragments, compounds 24 and 50, bound exclusively in pocket 3 located at the interface between the helicase cassettes and including residues from the N-terminal IG domain and the C-terminal RecA2 and WH domains, albeit with a distance of approximately 21 Å to pocket 1 (Fig. 3a). Compounds 24 and 50 again exhibited different binding poses. Compound 18 exhibited a third binding site, pocket 4, corresponding to the C-terminal ATP-binding site of hBRR2 T1 (Fig. 3a). Pocket 5 is located on the surface of the N-terminal RecA1 domain and was bound by compound 26 (Fig. 3a). Finally, compound 39 not only bound to pockets 1 and 2, but also occupied pocket 6 between the N-terminal IG domain of hBRR2 T1 and hJab1 ÁC (Fig. 3a).
The lack of binding of sulfaguanidine in the hBRR2 T1 -hJab1 ÁC complex structure (conformation 2) is most likely due to the altered binding site in this complex compared with the isolated hBRR2 T1 structure (conformation 1; Fig. 1e), which results in decreased affinity of the compound. Therefore, this finding suggests that the derivatives that did bind at pocket 1 exhibit improved binding to the respective part of the pocket compared with sulfaguanidine. These observations show that some of the derivatives exhibit sufficient complementarity to parts of the original sulfaguanidine binding site to remain associated with regions of the original sulfaguanidine binding site upon conformational rearrangement. Remarkably, however, the nine derivatives also bound at entirely new binding pockets and adopted diverse binding poses in these pockets.

research papers 3.3. Conformation-dependent binding hot spots in hBRR2 T1
It has been suggested that the tendency of fragment derivatives to retain the binding mode of the original hit depends on whether the binding site represents a binding hot spot (Kozakov, Hall, Jehle et al., 2015). Hot spots are regions of a protein that contribute a substantial fraction of binding free energy upon interaction with a ligand and thus can offer a binding site to diverse fragments . Hot spots have been experimentally delineated, for example, via the identification of hot spot residues by alanine-scanning mutagenesis (DeLano, 2002) or by determining crystal structures after soaking with organic solvents (Mattos & Ringe, 1996). In good agreement with experimental data, hot spots can also be predicted with the FTMap server , which calculates the interaction energy of 16 probe molecules upon binding protein surface areas (Kozakov, Grove et al., 2015). Probe molecules that bind to overlapping sites constitute a cluster, which identifies a hot spot. The clusters are ranked by the number of interacting probe molecules .
We used the FTMap server to predict hot spots on hBRR2 T1 in the two conformations observed in the isolated hBRR2 T1 and the hBRR2 T1 -hJab1 ÁC crystal structures (Figs. 3b and 3c). 13 hot spots were predicted for isolated hBRR2 T1 (Fig. 3b). The primary hot spot (16 interacting probes) is located at the interface between the cassettes, coinciding with the position of known hBRR2 inhibitors (Iwatani-Yoshihara et al., 2017;. A second hot spot (13 interacting probes) overlaps with the original sulfaguanidine-binding site (Fig. 3b). This hot spot is surrounded by three other hot spots (ranks 6, 7 and 11). Additional hot spots were predicted near the ATP-binding pockets of the NC (rank 3) and CC (rank 4), and the remaining hot spots are distributed across hBRR2 T1 (Fig. 3b).
The nature and ranks of the hot spots predicted for hBRR2 T1 in conformation 2 observed in the hBRR2 T1 -hJab1 ÁC complex differed markedly (Fig. 3c). While the primary hot spot was again predicted at the cassette interface, it overlapped with the binding sites of compounds 24 and 50 at pocket 3 (Figs. 3c and  3d). Regions corresponding to the original sulfaguanidine binding site in conformation 1 were not predicted to be hot spots in conformation 2 of the hBRR2 T1 -hJab1 ÁC complex. Instead, the N-terminal nucleotide-binding pocket is the second ranked predicted hot spot (Fig. 3c). A new predicted hot spot (rank 3, nine interacting probes) is located in the N-terminal RecA1 domain (Fig. 3c), corresponding to binding pocket 2 (Fig. 3d) that we identified as the binding site for six fragments (Fig. 3a). However, while four additional hot spots were predicted at the interface of hBRR2 T1 and hJab1 ÁC , these sites differed from pocket 6 at this interface, which we observed as a binding site for compound 39 (Figs. 3c and 3d). Omitting the hJab1 ÁC structural coordinates during the FTMap server analysis did not lead to major differences in the predicted hot spots, suggesting that hot spots on the surface of hBRR2 T1 predominantly depend on the conformation of hBRR2 T1 rather than the presence of hJab1 ÁC . Overall, this   (a) The binding sites of nine fragments on the hBRR2 T1 -hJab1 ÁC complex. The crystal structures of the fragment complexes were aligned with the crystal structure of unliganded hBRR2 T1 -hJab1 ÁC (PDB entry 6s8q), and only the unliganded hBRR2 T1 -hJab1 ÁC structure is shown as a cartoon for clarity. Pocket 1 corresponds to part of the sulfaguanidine binding site in isolated hBRR2 T1 and is bound by fragments 18, 26, 34 and 39. Pocket 2 is located in the N-terminal RecA1 domain and is bound by fragments 18, 34 and 39 (pose 1) and fragments 76, 78 and 86 (pose 2). Pocket 3 is located at the interface between the cassettes at a distance of approximately 21 Å from pocket 1 and is bound by fragments 24 and 50. Pocket 4 corresponds to the ATP-binding pocket of the CC and is bound by fragment 18. Pocket 5 is located on the surface of hBRR2 T1 at the N-terminal RecA1 domain and is bound by compound 26. Pocket 6 is positioned at the interface between the N-terminal IG domain of hBRR2 T1 and hJab1 ÁC and is bound by fragment 39. Fragments are shown as sticks and coloured by atom type as in Fig. 1(c), except that fragment C atoms are coloured by fragment. (b) FTMap results for isolated hBRR2 T1 (PDB entry 4f91; grey cartoon). The FTMap server was used to predict the binding of probes to the crystal structure of isolated hBRR2 T1 . Probes that bind to the same site form a cluster (C1-C13, clusters 1-13). The clusters are ranked according to the number of predicted interacting probe molecules, with lower cluster numbers corresponding to a larger number of probes. Numbers of interacting probes are indicated in parentheses. Predicted interacting probe molecules are shown as sticks and coloured by atom type as in Fig. 1(c), except that probe molecule C atoms are coloured by probe molecule. (c) FTMap results for the hBRR2 T1 -hJab1 ÁC complex (PDB entry 6s8q; hBRR2 T1 , grey cartoon; hJab1 ÁC , dark grey cartoon). Clusters and bound probes are shown and labelled as in (b). (d) Comparison of predicted hot spots and identified fragment-binding pockets. The structures of all hBRR2 T1 -hJab1 ÁC -fragment complexes were aligned with the unliganded hBRR2 T1 -hJab1 ÁC complex structure including the FTMap-predicted clusters. Interacting fragments and FTMap-predicted interacting probe molecules are shown as sticks. Interacting fragments, cyan; FTMap-predicted interacting probe molecules, magenta. Fragment-binding pockets are labelled as in (a) and clusters are labelled as in (c). analysis suggests that predicted hot spots can change decisively when a protein undergoes conformational changes, even if the rearrangement preserves the structures of individual domains and mainly constitutes rigid-body rearrangements of domain assemblies (in this case the hBRR2 helicase cassettes). Furthermore, in the case studied here, the predicted hot spots overlap with, but are not perfectly congruent with, observed fragment-binding sites.

Weak interactions favour low binding-mode conservation
We did not find that the preferences of ligands for a specific binding pocket correlated with a trend in general physicochemical properties such as molecular mass, lipophilicity (logP value) or the number of rotatable bonds (Table 2). Therefore, we inspected the chemical details of the interactions within the pockets, which might hint at features of the ligands that explain their binding preferences.
Pocket 1, corresponding to the altered sulfaguanidine binding site, was occupied by fragments 18, 26, 34 and 39. Pocket 1 was not a predicted hot spot in this conformation 2. Compounds bound at this position are likely to exhibit some conserved interactions compared with the original sulfaguanidine binding mode. Indeed, some key interactions with residues from the C-terminal cassette remained (Fig. 4a). Among them are van der Waals interactions with Asn1531, Gly1708 and Ser1709 and a hydrogen bond to Gln1707. Nevertheless, several of the hydrogen bonds of the original sulfaguanidine binding mode are missing and the compounds predominantly exhibit rather weak hydrophobic contacts.
The ligands of pocket 2, which is located within the N-terminal RecA1 domain, exhibited two different binding poses. One pose is adopted by fragments 18, 34 and 39 and the other is adopted by fragments 76, 78 and 86 (Fig. 4b). The main conserved property of ligands 76, 78 and 86 compared with ligands 18, 34 and 39 is an additional methoxy group attached to the amino group. The van der Waals interactions of the methoxy group with Leu631, Thr597 and Arg634 are probably responsible for the alternative pose of ligands 76, 78 and 86 compared with ligands 18, 34 and 39 at pocket 2.
Ligands 24 and 50 bound exclusively at pocket 3, which is positioned at the interface between the N-terminal IG domain and the C-terminal RecA2/WH domains. Fragment 24 also engages in a few hydrophobic interactions ( Fig. 4c; the interactions of fragment 50 are not shown because of the relatively low resolution, 3.5 Å , of the corresponding complex structure).
Pockets 4, 5 and 6 were each only bound by one fragment. The position of compound 18 in pocket 4 directly overlaps with the observed position of ATP in the C-terminal ATPbinding site (Fig. 4d). Indeed, the sulfonyl group of compound 18 established interactions with key residues of ATP-binding motifs. A hydrogen bond is formed to Lys1356 of motif I (which usually contacts theand -phosphate of ATP), as well as van der Waals interactions with Gly1353, Gly1355 (part of the flexible phosphate-binding loop) and Asp1454 and Glu1455 of motif II (responsible for the coordination of magnesium and, in active helicase modules, for the activation of a water molecule for hydrolysis). Perhaps surprisingly, we did not detect binding of compound 18 to the N-terminal ATPbinding pocket. The C-terminal helicase cassette is inactive in ATP hydrolysis, the motifs deviate from those of the N-terminal cassette (Santos et al., 2012;Kim & Rossi, 1999) and the pocket of the N-terminal cassette is probably more flexible to undergo frequent ATP hydrolysis cycles (Absmeier et al., 2021); this probably contributes to the lack of binding of compound 18 at the N-terminal site.
Compound 26 bound in a groove located on the surface of the N-terminal RecA1 domain of hBRR2 T1 : pocket 5 (Fig. 4e). In this position, the para-positioned methyl group of compound 26 is involved in multiple van der Waals interactions with Gly921, Tyr922, Ala923 and Ile927.
Finally, pocket 6 was occupied by compound 39, which additionally bound at pocket 1 and pocket 2. Pocket 6 is located between hBRR2 T1 and the interaction partner hJab1 ÁC (Fig. 4f ). Here, compound 39 sustains hydrogen bonds to Asp1227 and Val1228 and van der Waals interactions with Asp1229 and Phe1259 of hBRR2 T1 and with Leu2268 and Asn2109 of hJab1 ÁC .
In conclusion, all ligands bound to hBRR2 T1 in conformation 2 appeared to be contacted mainly through van der Waals interactions, often mediated by the core benzene ring, whereas stronger types of interaction are not frequently observed. These comparably weak interactions are likely to give rise to a low binding affinity of the compounds in the respective pocket and therefore facilitate changes in binding modes.

Some fragments inhibit BRR2 helicase activity
To further test whether the bound fragments identified in our crystallographic screen might constitute interesting hits for the development of new hBRR2-modulating substances, we tested the effects of the fragments identified in co-crystal structures on hBRR2 helicase activity. hBRR2 T1 -mediated RNA unwinding in the presence of compounds compared with a DMSO control was monitored by single time-point unwinding assays of a U4/U6 di-snRNA duplex, in which the  LigPlot (Wallace et al., 1995) representation of the fragment-binding modes in the pockets of hBRR2 T1 in the hBRR2 T1 -hJab1 ÁC complex. For the location of the pockets, see Fig. 3 U4 snRNA strand was radioactively labelled. As a positive control for a substance with known effects on unwinding activity, we used a previously published hBRR2 inhibitor: compound 33a developed by Ito et al. (2017). While compound 33a efficiently inhibited hBRR2 T1mediated U4/U6 unwinding (45% of the unwinding amplitude of the DMSO control after 2 min), we did not detect any effect of sulfaguanidine on unwinding at 300 mM or 1 mM (Supplementary Fig. S3).
In conclusion, only compounds 24 and 50 clearly bound to hBRR2 T1 and additionally showed an effect on helicase activity. They also both bound to the same pocket 3. Compound 24 is known as Piloty's acid, which can produce nitroxyl and nitrous oxide (Smulik-Izydorczyk et al., 2019) and is a known inhibitor of aldehyde dehydrogenase (Nagasawa et al., 1995). However, our assays were conducted at a pH below 8.0, where such a decomposition of compound 24 is unlikely. An effect of both compounds 24 and 50 upon the binding of pocket 3 would be in line with the effect of the known inhibitors of hBRR2 that also bind in this region (Iwatani-Yoshihara et al., 2017;Ito et al., 2017). However, we cannot exclude any nonspecific effects of these compounds at the high concentrations used in biochemical assays. Further elaboration of these fragments would be required to convert them to compounds with higher potency and specificity.

Discussion
In this fragment-screening approach for the spliceosomal helicase hBRR2, multiple binding sites and binding poses were identified for ten fragments with a conserved chemical core structure. Based on the initial hit sulfaguanidine as a ligand at the interface of the helicase cassettes, nine related fragments were identified to bind to six different pockets. We subsequently identified protein and ligand properties as a basis for the drastic alterations in binding site and binding pose.
The first obvious cause for changed or abrogated binding are the conformational differences of hBRR2 T1 in the two different crystal forms (isolated hBrr2 T1 versus the hBRR2 T1 -hJab1 ÁC complex). In particular, the position of the CC relative to the NC is shifted in the hBRR2 T1 -hJab1 ÁC complex structure, so that the interface between the helicase cassettes, and thus the sulfaguanidine binding site, is affected (Fig. 1e). Therefore, it is not surprising that sulfaguanidine as well as several compounds with the same chemical core structure no longer bound to the remaining parts of the original pocket 1 in the hBRR2 T1 -hJab1 ÁC complex crystals. Surprisingly, however, five other pockets were occupied by sulfaguanidinederived compounds. Although the conformational differences between isolated hBRR2 T1 and the hBRR2 T1 -hJab1 ÁC complex strongly affect the sulfaguanidine-binding pocket 1 [root-mean-square deviation (r.m.s.d.) of 4.12 Å upon alignment of domains from pocket 1 in PDB entries 4f91 and 6s8q], the other pockets remain essentially unaltered (r.m.s.d. values of around 0.44-0.73 Å upon the alignment of the domains contributing to the pockets; Supplementary Table S2).
Previously, Kozakov and coworkers hypothesized that the hot spots of a protein can determine the binding potential and therefore the degree of binding-mode conservation for ligands (Kozakov, Hall, Jehle et al., 2015). The FTMap analyser of Kozakov, Grove et al. (2015) predicted drastically different hot spots for hBRR2 T1 in the two crystalline conformations. The cluster/hot spot of the sulfaguanidine binding site was predicted for hBRR2 T1 in the isolated conformation but not in the hJab1 ÁC complex conformation. In agreement with the altered binding site and the absence of a predicted hot spot, only four of the compounds bound at parts of pocket 1 in conformation 2. Remarkably, the FTMap server predicted a new hot spot, cluster 3, which was only present in the hJab1 ÁC complex conformation (pocket 2; Fig. 3c), consistent with the observed binding of six fragments (compounds 18, 34, 39, 76, 78 and 86) to this pocket (Fig. 3d). Other predicted clusters, 1, 10 and 12, were also occupied by tested fragments (pocket 3 by compounds 24 and 50 and pocket 4 by compound 18; Figs. 3c and 3d). In conclusion, the predicted hot spots differ drastically in both conformations, although the rigid-body alterations only have a very minor structural impact on the majority of the protein pockets, with the exception of pocket 1. In agreement, some of the predicted hot spots are occupied by fragments from our screen.
The properties of the fragments from our screen, such as size, lipophilicity and the number of rotatable bonds (Table 2), are in the typical ranges for fragments used in screens (Congreve et al., 2003). However, all nine of the bound fragments as well as the conserved core have a size of more than 150 Da. Drwal and coworkers previously noted, based on an analysis of PDB structures, that the binding mode of a fragment with molecular mass of 150 Da or above tends to be conserved (Drwal et al., 2018). In contrast to this suggestion, we observed marked binding-site and binding-pose variations with related compounds above this size limit. Although our study solely resembles an exemplary test case of the binding of ten similar fragments to a protein, it demonstrates that these unexpected observations can be made in a reasonable approach.
Clearly, the lack of strong hydrogen-bond interactions is probably one of the main causes of low binding-mode conservation, as also noted in the study by Malhotra & Karanicolas (2017). The weak van der Waals interactions of the benzene ring and methyl groups are likely to give rise to low affinity of the compounds within each binding site of hBRR2 T1 . Further moieties of this core can promote preferential binding to other pockets, which may have an improved affinity for the differently decorated fragment derivative. A remarkable example from our screen is the binding mode of compound 24 compared with compound 76. The compounds only differ in a methyl group, so that compound 24 contains a hydroxyl group that is linked to the amino group, whereas compound 76 has a methoxy group. This modification is sufficient to cause the binding of compound 24 to pocket 3 in the interface between the cassettes, whereas compound 76 is bound in pocket 2 within the N-terminal RecA1 domain.
The fragments identified as ligands in our screen could be used for further elaboration into more potent hBRR2 modulators. Due to its essential role in pre-mRNA splicing and due to the association of mutations that affect hBRR2 with the disease retinitis pigmentosa, this helicase is an interesting target for drug discovery. Fragments such as compounds 24 and 50 with initial effects on hBRR2dependent RNA unwinding or fragment 39 within an interesting pocket at the hBRR2-hJab1 ÁC interface could be starting points for such approaches.
In general, hot spots are considered to be rather insensitive to the flexibility of a protein, in contrast to, for example, the shapes of binding sites (Kozakov et al., 2011;Kozakov, Hall, Napoleon et al., 2015). However, our analysis demonstrates that the conformation of a protein can drastically influence the location and the strength of hot spots. Whereas slight side-chain variations and a certain pocket flexibility are not decisive in most cases (Kozakov, Hall, Napoleon et al., 2015), hot-spot predictions for proteins with high intrinsic flexibility should therefore take into account the known spectrum of structural variability. Indeed, conformational dynamics can be critical for the druggability of a protein and some potential binding sites might only be detected in nonstatic representations such as molecular-dynamics simulations, as demonstrated, for instance, for Bcl-xL (Brown & Hajduk, 2006). Also, in the case of CDK2 it was shown that the binding mode of known inhibitors differed drastically between two crystal forms: the monomeric inactive conformation of CDK2 and the active CDK2-cyclin complex form (Kontopidis et al., 2006). Some inhibitors of CDK2 bound more strongly to the active complex, while others preferentially bound to the inactive monomeric form, so that structure-activity relationship data derived exclusively from one of the two forms could be misleading (Kontopidis et al., 2006).
Clearly, only hot spots in physiologically relevant conformations are important and it has to be considered for FBDD that crystal structures may exhibit irrelevant binding sites. Especially for methods that assess ligand binding in solution, differences in hot spots within different simultaneously present conformations of the target protein should be considered. Observed binding of derivatives could potentially be related to changes in binding modes, which should be carefully validated by structural techniques at each step of fragment elaboration into a lead. Nevertheless, new binding modes might enable to explore the chemical space in novel, previously unidentified directions.

Related literature
The following reference is cited in the supporting information for this article: Sander et al. (2015).